1 CAACCATTGC AGATCAGTGT GTGAGGGAAC TGCCATCATG AGGTCTGACA 
51 AGTCAGCTTT GGTATTTCTG CTCCTGCAGC TCTTCTGTGT TGGCTGTGGA 
101 TTCTGTGGGA AAGTCCTGGT GTGGCCCTGT GACATGAGCC ATTGGCTTAA 
151 TGTCAAGGTC ATTCTAGAAG AGCTCATAGT GAGAGGCCAT GAGGTAACAG 
201 TATTGACTCA CTCAAAGCCT TCGTTAATTG ACTACAGGAA GCCTTCTGCA 
251 TTGAAATTTG AGGTGGTCCA TATGCCACAG GACAGAACAG AAGAAAATGA 
301 AATATTTGTT GACCTAGCTC TGAATGTCTT GCCAGGCTTA TCAACCTGGC 
351 AATCAGTTAT AAAATTAAAT GATTTTTTTG TTGAAATAAG AGGAACTTTA 
401 AAAATGATGT GTGAGAGCTT TATCTACAAT CAGACGCTTA TGAAGAAGCT 
451 ACAGGAAACC AACTACGATG TAATGCTTAT AGACCCTGTG ATTCCCTGTG 
501 GAGACCTGAT GGCTGAGTTG CTTGCAGTCC CTTTTGTGCT CACACTTAGA 
551 ATTTCTGTAG GAGGCAATAT GGAGCGAAGC TGTGGGAAAC TTCCAGCTCC 
601 ACTTTCCTAT GTACCTGTGC CTATGACAGG ACTAACAGAC AGAATGACCT 
651 TTCTGGAAAG AGTAAAAAAT TCAATGCTTT CAGTTTTGTT CCACTTCTGG 
701 ATTCAGGATT ACGACTATCA TTTTTGGGAA GAGTTTTATA GTAAGGCATT 
751 AGGAAGGCCC ACTACATTAT GTGAGACTGT GGGAAAAGCT GAGATATGGC 
801 TAATACGAAC ATATTGGGAT TTTGAATTTC CTCAACCATA CCAACCTAAC 
851 TTTGAGTTTG TTGGAGGATT GCACTGTAAA CCTGCCAAAG CTTTGCCTAA 
901 GGAAATGGAA AATTTTGTCC AGAGTTCAGG GGAAGATGGT ATTGTGGTGT 
951 TTTCTCTGGG GTCACTGTTT CAAAATGTTA CAGAAGAAAA GGCTAATATC 
1001 ATTGCTTCAG CCCTTGCCCA GATCCCACAG AAGGTGTTAT GGAGGTACAA 
1051 AGGAAAAAAA CCATCCACAT TAGGAGCCAA TACTCGGCTG TATGATTGGA 
1101 TACCCCAGAA TGATCTTCTT GGTCATCCCA AAACCAAAGC TTTTATCACT 
1151 CATGGTGGAA TGAATGGGAT CTATGAAGCT ATTTACCATG GGGTCCCTAT 
1201 GGTGGGAGTT CCCATATTTG GTGATCAGCT TGATAACATA GCTCACATGA 
1251 AGGCCAAAGG AGCAGCTGTA GAAATAAACT TCAAAACTAT GACAAGCGAA 
1301 GATTTACTGA GGGCTTTGAG AACAGTCATT ACCGATTCCT CTTATAAAGA 
1351 GAATGCTATG AGATTATCAA GAATTCACCA TGATCAACCT GTAAAGCCCC 
1401 TAGATCGAGC AGTCTTCTGG ATCGAGTTTG TCATGCGCCA CAAAGGAGCC 
14 51 AAGCACCTGC GATCAGCTGC CCATGACCTC ACCTGGTTCC AGCACTACTC 
1501 TATAGATGTG ATTGGGTTCC TGCTGACCTG TGTGGCAACT GCTATATTCT 
1551 TGTTCACAAA ATGTTTTTTA TTTTCCTGTC AAAAATTTAA TAAAACTAGA 
1601 AAGATAGAAA AGAGGGAATA GATCTTTCCA AATTCAAGAA AGACCTGATG 
1651 GGGTAATCCT GTTAATTCCA GCCACATAGA ATTTGGTGAA AACCTTGCTA 
1701 TTTTCATATT ATCTATTCTG TTATTTTATC TTAGCTATAT AGCCTAGAAT 
1751 TCCATGATCA TGAGGTTGTG AGTATATCTC ATTCTTTCGT TGCATTTTCC 
1801 TAGGTGTGCT TACTCTCTTC TCTCACTTTG TGACACAAGG ACATGAATAC 
1851 ATCTAAATTT TCCTATTTCT GATATCACTG TTTCCATGAC GTCATTACTT 
1901 CTCTAACCTT AAGTGATAGG GTGACCTGCA ATATGCTGAT TCCTGGTGTT 
1951 TGCACAAACA CATGGATGTA AAGAAGTAAA AAATGTAZ\AA TTCACAAAAT 
2001 TCAGTAAACC ACACAAATCA ATGAAGCATT CTATGACATT AGCTTGTTAT 
2051 GAGTAACATA ATGATTTTTC TTTTTCAATT TAAATAAGCC CTTCTACATA 
2101 CCCAGCATTA CTGATCTCAG ACAATGAATT GCTAAAAATG ACGATAGGGC 
2151 ATTACACTCA GAATAGTTTG CTATATTTCC ACATACCTCA TCTAGATGTC 
2201 ATAGCCTACA TTTCTGCCAT CACTTAACTG ACATTTTTTG TGTGTTCTTG 
2251 ATGATAAATA GACAGTTCTT ATTATTGTCC TCAAATAATA AAAGAAACTG 
2301 AAATTTTCTT ACATAGAGAA AATGTCCATA AGATATTCAA GTTAAACAGA 
2351 TTATTTTGAG ATAAGTAACC ATTAGAAATA TGTGATTGTA ATTTCTGATT 
2401 TTATAAAATT TTAATTGATA GTACACTTGA TTTAAATGTC TATTCTTTAA 
2451 AATGATGAAT ACTCATAATT CTTATCTCTA TAATCAAAAG TATAATTTAC 
2501 TGTAGAAAAA TAAAGAGATG CTTGTTCTGA AAGTAAAAAA AAAAAAAAAA 
2551 AAAACACTGT CATGCCGTTA CGTAGCGTAT CGTTGACAGC CCACTGTCAT 
2601 GCCGTTACGT AGCATATCGT TGACAGCGAC ACTGTCATGC CGTTACGTAG 
2 651 CGTATCGTTG ACAGCACTGT CATGCGTTAC GAGCGTATCG TTGACAGCAC 
2701 TGTCATGCCG TTACGTAGCG TATCGTTGAC AGCAAAACAC TGTCAGCCGT 
2751 TACGTAGCG (SEQ ID N0:1) 



FEATURES: 

5'UTR: 1-37 

Start Codon: 38 

Stop Codon: 1619 

3'UTR: 1622 
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Homologous proteins : 

Top 10 BLAST Hits 

CRA! 147000022596013 /altid=gi i 10438148 /def =db j I BAB1517 9 . 1 i (AK. . 
CRAi 1000682322899 /altid=gi I 5802 604 /def=gb i AAD51732 . 1 1 (AF1752.. 
CRAI335001098690982 /altid=gi 1 11436851 /def-ref I XP_003547 . 1 1 UD. . 
CRAI 18000005226060 /altid=gi i 5803213 /def =ref t NP__00678 9 . 1 ! UDP 
CRAI 18000005155438 /aitid=gi I 4507821 /def-ref 1 NP__001068 . 1 ! UDP 
CRAI 18000005147353 /altid=gi 1 6175083 /def-sp 1 P0 6133 1UDB4_HUMAN 
CRA[18000004953169 /altid-gi I 484383 /def =pir M JN0619 glucuronos . . 
CRA118000005148770 /altid-gi ! 3153832 /def-gb 1 AAC95002 . 1 ! (AF064.. 
CRAi 18000004903910 /altid-gi t 4507819 /def =ref t NP__001067 . 1 1 UDP 
CRA118000005164461 /altid=gi| 3426332 /def=gb ! AAC32272 . 1 ! (AF081.. 
CRAI 1000682327923 /altid^gi 1 588124 6 /def=gb 1 AAD55093 . 1 I AF180322 . . 
CRA! 18000005219476 /altid=gi i 8134780 /def =sp | Q9XT55 | UDBJ__MACFA 

BLAST dbEST hits: 

gi 1 10971169 /dataset-dbest /taxon=96. . . 
gii 11068678 /dataset-dbest /taxon=96... 
gi! 679005 /dataset=dbest /taxon=9606 /... 
gi 1 3173232 /dataset=dbest /taxon-9606 ... 
gii 3134358 /dataset=dbest /taxon-9605 ... 
gi 110298020 /dataset=dbest /taxon-96. . . 
gi 111974507 /dataset-dbest /taxon=96. . . 
gii 11973717 /dataset=dbest /taxon-96... 
gi 1 12673874 /dataset=dbest /taKon=-96. . . 
gi|10887798 /dataset-dbest /taxon=96... 

EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

Expression information from BLAST dbEST hits: 

gi 1 10971169 Kidney- hypernephroma 

gii 11068678 HepG2 cell line 

gii 679005 Liver 

gii 3173232 Kidney 

gi I 3134358 Kidney 

gi 110298020 Hepatocellular carcinoma 

gii 11974507 Normal pigmental retinal epithelium 

gii 11973717 Normal pigmental retinal epithelium 

gii 12673874 Kidney hypernephroma 

gi 1 10887798 Kidney 

Expression information from PCR-based tissue scre ening panels: 
Human fetal liver 



931 0.0 

795 0.0 

679 0.0 

677 0.0 

67 6 0.0 

675 0.0 

674 0.0 

674 0.0 

669 0.0 

668 0.0 

668 0.0 

667 0.0 



383 e-103 

234 6e-59 

212 2e-52 

212 2e-52 

212 2e-52 

200 8e-49 

196 le-47 

172 2e-40 

137 le-29 

125 4e-26 




1 MRSDKSALVF LLLQLFCVGC GFCGKVLVWP CDMSHWLNVK VILEELIVRG 
51 HEVTVLTHSK PSLIDYRKPS ALKFEVVHMP QDRTEENEIF VDLALNVLPG 
101 LSTWQSVIKL NDFFVEIRGT LKMMCESFIY NQTLMKKLQE TNYDVMLIDP 
151 VIPCGDLMAE LLAVPFVLTL RISVGGNMER SCGKLPAPLS YVPVPMTGLT 
201 DRMTFLERVK NSMLSVLFHF WIQDYDYHFW EEFYSKALGR PTTLCETVGK 
251 AEIWLIRTYW DFEFPQPYQP NFEFVGGLHC KPAKALPKEM ENFVQSSGED 
301 GIWFSLGSL FQNVTEEKAN IIASALAQIP QKVLWRYKGK KPSTLGANTR 
351 LYDWIPQNDL LGHPKTKAFI THGGMNGIYE AIYHGVPMVG VPIFGDQLDN 
401 lAHMKAKGAA VEINFKTMTS EDLLRALRTV ITDSSYKENA MRLSRIHHDQ 
451 PVKPLDRAVF WIEFVMRHKG AKHLRSAAHD LTWFQHYSID VIGFLLTCVA 
501 TAIFLFTKCF LFSCQKFNKT RKIEKRE (SEQ ID NO: 2) 



FEATURES : 

Functional domains and key regions: 

[1] PDOCOOOOl PSOOOOl ASN^GLYCOSYLATION 
N-glycosylation site 



Number of matches: 3 

1 131-134 NQTL 

2 313-316 NVTE 

3 518-521 NKTR 



[2] PDOC00004 PS00004 CAMP_PHOSPHO__SITE 

cAMP- and cGMP-dependent protein kinase phosphorylation site 



Nimber of matches: 2 

1 67-70 RKPS 

2 340-343 KKPS 



[3] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 



Number of matches: 6 

1 3-5 SDK 

2 120-122 TLK 

3 169-171 TLR 

4 200-202 TDR 

5 435-437 SYK 

6 520-522 TRK 



[4] PDOC00006 PS00006 CK2_PH0SPH0_SITE 
Casein kinase II phosphorylation site 



Number of 


matches : 


9 


1 


62-65 


SLID 


2 


141-144 


TNYD 


3 


204-207 


TFLE 


4 


243-246 


TLCE 


5 


258-261 


TYWD 


6 


296-299 


SSGE 


7 


297-300 


SGED 


8 


419-422 


TSED 


9 


435-438 


SYKE 



[5] PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 

Number of matches: 2 

1 122-130 KMMCESFIY 

2 13 6-143 KKLQETNY 
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[6] PDOC00008 PS00008 MYRISTYL 
W-myristoylation site 

Nuraber of matches: 4 

1 19-24 GCGFCG 

2 276-281 GGLHCK 

3 373-378 GGMNGI 

4 377-382 GIYEAI 

[7] PDOC00009 PS00009 AMIDATION 
Amidation site 

338-341 KGKK 

[8] PDOC00359 PS00375 UDPGT 
UDP-glycosyltransf erases signature 

354-397 WIPQNDLLGHPKTKAFITHGGMNGIYEAIYHGVPMVGVPIFGDQ 

[9] PDOC00804 PS01047 HMA 
Heavy-metal-associated domain 

12-41 LLQLFCVGCGFCGKVLVWPCDMSHWLNVKV 



Membrane spanning structure and domains : 





Helix 


Begin 


End 


Score 


Certainty 




1 


5 


25 


1.802 


Certain 




2 


157 


177 


0.765 


Putative 




3 


181 


201 


0.779 


Putative 




4 


377 


397 


0.735 


Putative 




5 


491 


511 


1. 931 


Certain 



v. lis 
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BLAST Alignment to Top Hit: 

>CRA| 147000022596013 /altid=gi i 10438148 /def^dbj ! BAB1517 9 . 1 t 
(AK025587) unnamed protein product [Homo sapiens] 
/org^Homo sapiens /taxon=9606 /dataset=nraa /length=4 4 9 
Length = 44 9 

Score = 931 bits (2381), Expect = 0.0 

Identities = 448/449 (99%), Positives = 448/449 (99%) 



Query: 


79 


MPQDRTEENEIFVDLALNVLPGLSTWQSVIKLNDFFVEIRGTLKMMCESFIYNQTLMKKL 


138 






MPQDRTESNEIFVDLALNVLPGLSTWQSVIKLNDFFVEIRGTLKMMCESFIYNQTLMKKL 




Sb j ct : 


1 


MPQDRTESNEIFVDLALNVLPGLSTWQSVIKLNDFFVEIRGTLKMMCESFIYNQTLMKKL 


60 


Query: 


139 


QETNYDTOLIDPVIPCGDLMAELLAVPFVLTLRISVGGNMERSCGKLPAPLSYVPVPMTG 


198 






QETNYDVMLIDPVIPCGDLMAELLAVPFVLTLRISVGGNMERSCGKLPAPLSYVPVPMTG 




Sb j ct : 


61 


QETNYDVMLIDPVIPCGDLiyiAELLAVPFVLTLRISVGGNMERSCGKLPAPLSYVPVPMTG 


120 


Query: 


199 


LTDRMTFLERVKNSMLSVLFHFWIQDYDYHFWEEFYSKALGRPTTLCETVGKAEIWLIRT 


258 






LTDRMTFLERVKNSMLSVLFHFWIQDYDYHFWEEFYSfCALGRPTTLCETVGKAEIWLIRT 




Sbjct : 


121 


LTDRMTFLERVKNSMLSVLFHFWIQDYDYHFWEEFYSKALGRPTTLCETVGKAEIWLIRT 


180 


Query: 


259 


YWDFEFPQPYQPNFEFVGGLHCKPAKALPKEMENFVQSSGEDGIVVFSLGSLFQNVTEEK 


318 






YWDFEFPQPYQPNFEFVGGLHCKPAKALPKEMENFVQSSGEDGIWFSLGSLFQNVTEEK 




Sbjct: 


181 


YWDFEFPQPYQPNFEFVGGLHCKPAKALPKEMENFVQSSGEDGIWFSLGSLFQNVTEEK 


240 


Query: 


319 


ANIIASALAQIPQKVLWRYKGKKPSTLGANTRLYDWIPQNDLLGHPKTKAFITHGGMNGI 


378 






ANIIASALAQIPQKVLWRYKGKKPSTLGANTRLYDWIPQNDLLGHPKTKAFITHGGMNGI 




Sb j ct : 


241 


AN 1 1 AS ALAQ I PQKVLWR YKGKKP STLGANTRL YDI?^ I PQND LLGHPKTKAFI THGGMNG I 


300 


Query: 


379 


YEAIYHGVPMVGVPIFGDQLDNIAHMKAKGAAVEINFKTMTSEDLLRALRTVITDSSYKE 


438 






YEAIYHGVPMVGVPIFGDQLDNIAHMKAKGAAVEINFKTMTSEDLLRALRTVITDSSYKE 




Sbjct: 


301 


YEAIYHGVPMVGVPIFGDQLDNIAHMKAKGAAVEINFKTMTSEDLLRALRTVITDSSYKE 


360 


Query: 


439 


NAMRLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRSAAHDLTWFQHYSIDVIGFLLTC 


498 






NAMRLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRSAAHDLTWFQHYSIDVIGFLL C 




Sbjct: 


361 


NAMRLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRSAAHDLTWFQHYSIDVIGFLLAC 


420 


Query: 


499 


VATAIFLFTKCFLFSCQKFNKTRKISKRE 527 {residues 79-527 of SEQ ID NO: 2) 






VATAIFLFTKCFLFSCQKFNKTRKIEKRE 




Sbjct: 


421 


VATAIFLFTKCFLFSCQKFNKTRKIEKRE 449 (SEQ ID NO: 4) 





>CRA( 1000682322899 /altid-gi [ 5802604 /def-gb ! AAD51732 . 1 1 (AF175221) 
UDP glucuronosyltransferase UGT2A3 [Cavia porcellus] 
/org=Cavia porcellus /taxon=10141 /dataset=nraa 
/iength=530 
Length = 530 

Score = 795 bits (2030), Expect = 0.0 

Identities = 377/530 (71%), Positives - 435/530 (81%), Gaps = 3/530 (0%) 

Query; 1 MRSDKSALVFLLLQLFCVGCGFCGKVLVWPCDMSHWLNVKVILEELIVRGHEVTVLTHSK 60 

M K A LLL LOG GFCGKVLVWPC+MSHWLN+K +LEEL4- RGHEVTVLT S 
Sbjct: 1 MAPGKLASAVLLLLLCCAGSGFCGKVLVWPCEMSHWLNLKTLLEELVKRGHEVTVLTLSN 60 

Query: 61 PSLIDYRKPSALKFEWHMPQDRTEENEI FVDLALNVLPGLSTWQSVIKLNDFFVEI 117 

IDY + A FEV+ +P D+ I F++LA+NV+P + WQS L FFV+I 

Sbjct: 61 NLFIDYNRHPAFNFEVIPVPTDKNMSENILNEFIELAVNVMPTMPLWQSGKLLQQFFVQI 120 

Query: 118 RGTLKMMCESFIYNQTLMKKLQETNYDVMLIDPVIPCGDLMAELLAVPFVLTLRISVGGN 177 

L + C + +YNQ+LMKKL+++ YDV++ DPVIPCG+L+AE+L VPFV L+ S+G 
Sbjct: 121 TEDLGLNCRNTVYNQSLMKKLRDSKYDVLVTDPVIPCGELVAEMLGVPFVNMLKFSMGHT 180 
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Query: 


178 


MERSCGKLPAPLSYVPVPMTGLTDRMTFLERVKNSMLSVLFHFWIQDYDYHFWEEFYSKA 


237 






+E+ CG+LPAP SYVPVP+ GLT RMTF+ERVKN + SVLF FWIQ YDY FW++FYS+A 




Sbjct : 


181 


lEKYCGQLPAPPSYVPVPLGGLTTRMTFMERVKNMVFSVLFDFWIQQYDYKFWDQFYSEA 


240 


Query: 


238 


LGRPTTLCETVGKAE I WLIRT YWDFE FPQPYQPN FE FVGGLHCKPAKALPKEMENFVQ S S 


297 






LGRPTTLCE +GKAEIWLIRTYWDFEFP+PY PNFEFVGGLHCKPAK LPKEME FVQSS 




Sbjct: 


241 


LGRPTTLCEIMGKAEIWLIRTYWDFEFPRPYLPNFEFVGGLHCKPAKPLPKEMEEFVQSS 


300 


Query: 


298 


GEDGIVVFSLGSLFQNVTEEPvANIIASALAQIPQKVLWRYKGKKPSTLGANTRLYDWIPQ 


357 






GEDG+WFSLGS-H +N4-TEEKAN+IASALAQIPQKVLWRYKGKKP+TLG NTRL+DWIPQ 




Sbjct: 


301 


GEDGVVVFSLGSMVKNLTEEKANLIASALAQIPQKVLWRYKGKKPATLGPNTRLFDWIPQ 


360 


Query: 


358 


NDLLGHPKTKAFITHGGMNGIYEAIYHGVPMVGVPIFGDQLDNIAHMKAKGAAVEINFKT 


417 






NDLLGHPKTKAFITHGG NGIYEAIYHGVPHVG+PIF DQ DN+A MKAKGAAVE+N T 




Sbjct: 


361 


NDLLGHPKTKAFITHGGSNGIYEAIYHGVPMVGMPIFSDQPDNLAGMKAKGAAVEVTSIMNT 


420 


Query: 


418 


MTSEDLLRALRTVITDSSYKENAMRLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRSA 


477 






MTS DLL ALRTVI D +YKENAM+LSRIHHDQPVKPLDRA FW+EFVM HKGAKHLR A 




Sb j ct : 


421 


MTSADLLGALRTVINDPTYKENAMKLSRIHHDQPVKPLDRAAFWVEFVMHHKGAKHLRVA 


480 


Query: 


478 


AHDLTWFQHYSIDVIGFLLTCVATAIFLF7KCFLFSCQKFNKTRKIEKRE 527 (residues 1- 


527 of 


SEQ 


ID N0:2) 








AHDL+WFQ++S+DVIGFLL CVA+AI L TKC LFS Q F K K K+E 




Sbjct: 


481 


AHDLSWFQYHSLDVIGFLLACVASAILLVTKCCLFSFQNFIKIGKRIKKE 530 (SEQ ID NO: 5) 



Hinmer search results (Pfam) : 

Model Description Score E-value H 

PF00201 UDP-glucoronosyl and UDP-glucosyl transferas 962.0 1.5e-285 1 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

PF00201 1/1 24 525 1 507 [] 962.0 1.5e-285 
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1 TTCTAGAGGG TTGGAACAAC TTTTCCCTGA TACATTGCAT TTTTTTGATA 
51 CCTTCAGTAC ATGTTAAACT GGCAACCACC AGTGAACTTT ACTCTTAAAA 
101 TATTAATTTT TAACTTCTGT GCTTATATTG TCATTTCAAC TCCTTGCTTA 
151 GTAACTACAA AACCATTGCA GATCAGTGTG TGAGGGAACT GCCATCATGA 
201 GGTCTGACAA GTCAGCTTTG GTATTTCTGC TCCTGCAGCT CTTCTGTGTT 
251 GGCTGTGGAT TCTGTGGGAA AGTCCTGGTG TGGCCCTGTG ACATGAGCCA 
301 TTGGCTTAAT GTCAAGGTCA TTCTAGAAGA GCTCATAGTG AGAGGCCATG 
351 AGGTAACAGT ATTGACTCAC TCAAAGCCTT CGTTAATTGA CTACAGGAAG 
4 01 CCTTCTGCAT TGAAATTTGA GGTGGTCCAT ATGCCACAGG ACAGAACAGA 
451 AGAAAATGAA ATATTTGTTG ACCTAGCTCT GAATGTCTTG CCAGGCTTAT 
501 CAACCTGGCA ATCAGTTATA AAATTAAATG ATTTTTTTGT TGAAATAAGA 
551 GGAACTTTAA AAATGATGTG TGAGAGCTTT ATCTACAATC AGACACTTAT 
601 GAAGAAGCTA CAGGAAACCA ACTACGATGT AACGCTTATA GACCCTGTGA 
651 TTCCCCGTGG AGACCTGATG GCTGAGTTGC TTCCAGTCCC TTTTGTGCTC 
701 ACACTTAGAA CTTCTCTAAG AGGCAATATG GAGCGAAGCT GTGGGAAACT 
751 TCCAGCTCCA CTTTCCTATG TACCTGTGCC TATGACAGGA CTAACAGACA 
801 GAATGACCTT TCTGGAAAGA GTAAAAAATT CAATGCTTTC AGTTTTGTTC 
851 CACTTCTGGA TTCAGGATTA CGACTATCAT TTTTGGGAAG AGTTTTATAG 
901 TAAGGCATTA GGTAAGACAC TTTTGTTTTA TTTTTAATTT AGTTATCAAA 
951 AGAAATATTT TTAAAAATTG TCATACATTG TCTATGACAT ATATATGCAG 
1001 GTCAATGAGT TTTTTTAGAA AATGTTGTAG CTGTTTTTCA TAAAGAAAGT 
1051 GTATTTGTTC TAAGCGTAAG ATAACCTACT TTCTTAATAC CAGTAATATA 
1101 CTTAAAAATG ATCATCAATA ACTAAGAGAT TATATTTTGT ATTTCCTCCA 
1151 AATAGCGCAA ATCAACATCA CATATTTTTG AGAATCACTG ATTGTTAGTC 
1201 TGAATGTTAT AGAATTTCTA TTGAAATAAA ATGCTAATCA TTATTTTCTC 
1251 TCTCATCATG TATTTAAGAA AATCTTCAGA AGGTCTTCTT TGAATTAATT 
1301 TTTCAAGAGT CATTAAATTG AACATTTTCT AGAATTCTTT AATTTCTTAG 
1351 GTGATTACTT CACAAAAACT TGAAAAAATA TTATAAAAAG TTAAAAAACT 
1401 TACGGTCTTG TGGGGCATAA GATAGTAGAA TTTTTACTTT ACTGATATAC 
1451 ACCTATTTGA CTTATTTTTA TTTCTTTGCT TTACTGATAA AAAGTTGTTT 
1501 TGCTTTGCAA TTTTCATATA GTTGTGATCA GAGCTGGTCA ATGCAAGACA 
1551 TGTTTTTATC CAAATATGTT TGAGAATTAT GTAGAAACAT GAAAAAAGGT 
1601 ACAATTATAT CCGACACTAA AATATTGTTT AATGTATTCC AACGAATTCT 
1651 TATGCATAGA CTGTTTCACA GAACTAATAT TCAGAGGATC CCAGTTCAAA 
1701 TGTCCTTAGC CTTAGACATG ATTTGAATTT ACATATATTG ATTTGCTTTA 
1751 AATAATTTTC CATTCAGTAA GCTGTGCCTA GCTGCAGATA GCCTACCAGG 
1801 CTTTATGGAT CTAGGTAAAC AATACAAATC TCTTGGCCTC AAGTCTACAT 
1851 TCAGATATTA ATTTAAAGGG GTACAGCTAT ATAGAGGTCA CTGGCAAATT 
1901 TTGGTAAAAT AGGATTATAG TAAAAGCCCC CTGACAAGAT TGAAATTTAA 
1951 AATAAAACAA AAGTGTTATC AAAGGGGTGA AAGAGCATTT TCCAATAAAC 
2001 AAAAGTGGGT TCTGGCCATG CATTCAGAAA TTCCCCAACA ATTCTTTAAA 
2051 AATCATGGAG CAGCTTGATA TATAAGAAAT TCATTTAATA ACTATATTTA 
2101 TTATGTAGCT CCAACTTACT AAATTATTGA TTATTATATA TTTTATAGAA 
2151 TTATCTATTG TGAGTCTAAA TCAAGAGTAT ATATTCAAAC AACTATAGGA 
2201 AAAGGGATAT CAGTCAATTT CAATTCAAGG ATTTATTTCC ATAAGTGCTT 
2251 ACGCACAGGT GTATTTCATT TTATTATACA TTGCTTTATT GTCCTTCACA 
2301 AAAATTGCAA TTTACAAATT AAAGGTTTTT GAAAACCTTG AATCAAGCTA 
2351 ATCAATTTGG CGTAATATTT CCAACAACAA GTGTGTACTT TTGACTCTAT 
24 01 CACATATTGG CATTTATCAT GCTTTTTCAA ATTTTTCATT GTTATATCTG 
2451 TTACGGTGAT CTGGGATCAG TGTTCCTTGA TGGTTACACG TTTATTAGCT 
2501 TGGGGGCACC TTGATGTGTT ACAATATAAG ACAGCAAACT TAATTATAAA 
2551 TGTTGTGCAT GTACTAACTG CTCCGCTGAT TCGTTTCCCC ATCCCACTTC 
2601 TTCTTAGGCC TCCCTATTCC CTGAGACACA GTAATATAAC ATACAATGAC 
2651 TTCTAAATGT TCCAGTGAAA AGAAAAGTAG CAGGTCTCTC AATTTAAACC 
27 01 AAAAATATAA AGGAATAAGT TTAATGAGTA CTATAGTTTA GATATGGTTT 
27 51 GCTTGACCCT ACAAAATCCT GTGTTGAAAT TTGATCACCA ATATTGGAGG 
2801 TGGGGCTTGA TGGGAAGTGT TAGGGTCATG AGGGTAGATT CCTTATGAGT 
2851 ACATTAATGC TCTCCCTGGG GAAATGGGTG AGTTCGTTCT CACTCTATTA 
2901 GGTCCCAGGA GAGATAATTA TTAAZ\AAGAG CCAGGAACAT CCACCTTCTT 
2951 TCTCTTGCAT ATCTCTCATT ATCTGATCCC TGCACTTGCT GGCTCCCAAC 
3001 ATCTTCTTCA ATGAGTGGAG GAAACCAGAG GTCTTCACCA GACACAGATG 
3051 TTGGTGCCAT GCCTCTTGTA TACCCTGAAG AATTGTGAGC CAAATAAAAA 
3101 CCTTTTTCTT TTACAAATTA GACAGCCTCA GTTATTCCTT TGTAGCAACA 
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3151 AAAAAAGCCT GGGACAGGCC AAAAACTACA CCATTGCACC AAACAGTTAA 
3201 ACAAGATGTG AGTGCAAAGG AAAAGTTTTT GGAGGAAATT AAAAGTGCTA 
3251 CTCCAGTGTA CATACAAATG ATAAGAACAA ATAACCATTA TCAGTGCTGA 
3301 TATGGAGAAA ATTTTAGTTG TCTGGAGAGA AAATCAAATT AGCTAGCCAG 
3351 CTGCAGTGAT TCATATCTGT AATCCCAGTA ACTTGGGAGG CTCAGGTGGG 
34 01 AGAACGGCTT GAGCCCAGAA GTTTGAAGTC CAAGGCTGCA GTGAGCTATG 
34 51 ATTGCTCCAC TGCACTCCAA CCTAGGTGAT AGAGCAAAAC CACTACCAAA 
3501 AAAAAAAAAA AAAAAAAGAA GAAAAAGAAA AGAAAAAAAA TTAAACCAAC 
3551 CACAACATCA CCTTAGGTTT TGGCATTAGC TAAAAACTAA TACATAGTAA 
3601 AGCGTTAACT ATTCAATTGC ATGAAACCTC AGAGAGGAGA GGAAGATGCA 
3651 GAAAAAAAGA CTGAAGCTAG TAGAGGTTGA CTAATGAGGT TTACAGGAAT 
37 01 AAACTGCCTA CATGATGCAA AAGTTCAATG TGAAGCAATA GGAAGTCATG 
3751 CAGAAGACTT AGCTAATATA CTCAGTAAAT GTGGCTACAG TAAACAAATG 
3801 ATTTTCAATG TAGACCTAAC AGCCTTCTGT TGGAAGAAGA TGCCATTTAA 
3851 AACTTTCATA GCTAGAGAAG AGAAGTCAAT GCTTGTCTCT GAAGCTACAA 
3901 AAAACAGGCT GAATCTCTTG TAGTGGCTAA TGCAGCTGAT GACAAAGGTA 
3951 AAGCCAATGC CCATTTACTT TTTGTAATAA TTATAGAGGA CTCTTAATAA 
4 001 TTATGTTAAA TCTACTTTGC CTGTGTTATA TCAATGGAAC AACAAAGCCT 
4 051 GGATGATATC ACATTGGTAT ATGACATGGC TTATTGAATA TTTTAAGCAC 
4101 ACTGTTGAGA CCTATTGCTC AAAAAAGAGG ATTCCTTTCA AAATATTGCT 
4151 GCTCATTGAC AATTCACATG GTCAACAAAG GGCTCTGATT AAGATGTACA 
4201 GATATTAATG TTTGCCTGCT TGCTATTATT ACATCCATCT TACATGGCAT 
4251 GGATCATATA GCCTTGACTT TCAAGTCTTA TGTAAGAAAT ATATTTTGTA 
4301 AGGCTATAGC TCTTACTAAT GGGGAAAGTA TATTGAAAAC CTTTTCAAAA 
4 351 GGATTTTTCA TTCTAGATTC CATTAAGAAC ATTCATGGTT CATGAGAGGA 
4401 AGTCAAGATA TTAACATTAA CAAGAGTTTG GAAAAAATTT GATTCTAACT 
4451 CTCCTGGATG ATTTTGAGGG ATTGAAGACA TCATGTGAAG AATTAACTGG 
4501 GGATGGGGTG GTCATGAAAA AATAAATAGA ATTATAAGTG GGCCTGAAGG 
4551 TTTGTCTAAA TTGCTATAAT ATCATGATAA AACTAAAACC TGTAAAACCG 
4 601 GTGAGGAGGT GCTTTTTAAA CAGTTACTTT TTATAGATGA ACACAGAAAT 
4 651 TGGTTTTGTG AGTTGGAATC TTCTCCGAGT GAAAATGCTA TGAACATTGT 
47 01 TGAAATGGCT ACAAATGACT TAGAATATTA CACAAAATTA GTAGATAAGG 
4751 CAGCATCAAG GTTTGAGAGA ATGGACTCAA ATTTTGAAAG AAATTCTACT 
4 801 ATGGGTAAAC TGCTGTGAAA CATCATCATA TGCTACAGAG AAATCTTTCA 
4 851 TGAAAAGATG AGTCAATTCA TGCAACAATC TTTGTTGTCT AATTTTAAAA 
4 901 ATTGTCCAGC TGCCCTGATC AATCAACAGT AATCAGCACT GAGGCAAGAC 
4951 CCTACACCAG AAAAAATAAA AATAAAAAAC CTCACTTGCT GAAGACTCAG 
5001 CTTATTATTA GCACTTTTTA GCCATACTTT TAACTAAGGT ATGTGCATTC 
5051 CTTTTTAAAC GTGATGATAT TGCACAGCTA ATAGCCTACA AGGTATGGTT 
5101 AACATAACTT TTATATGTCC TGGGACCCAA ATTTGTGTGA ATCACTTTAT 
5151 TGACATATTC CTTTTATTGA GATGAACTGC AACTTATCTT GCAATATCTC 
5201 CAAGATATGT GTGTATGGCA TTTCAAATAA GATGTGAAAT TATTTTATTA 
5251 GTATAAAAAG CAAATTTAAT TTTCTTTCCT TTGATCATCT TTATCCTTGT 
5301 TACTGTGTAT TTATCCTTTA AACATTGAAT GACTCCAATT GTTTAAAACT 
5351 GAGTCTTTCT TAAATGAGTC CTAATATCAT AGTAATTAAA ATCACCTACA 
54 01 AGTTGGTAAT GCAGGCAGCA TGTGAGGCAC AGAAAACAAC AAATTTATAA 
54 51 GACATAAATG CATTTGCTTG GAAGCTGAGA GAAGGCTCTA TTCTAATTTC 
5501 TGATAACTTC AAACTGAGTA TCTTCAGTAA AATTTATTCA CTATCAAATT 
5551 CAAGGCGTTT GGATTTATGA CCTAGGAAAA AACTTCAAAC ATTAAAATGT 
5601 GATGACCTTA AAAAGAGGCT CTCCACACTA TGGTGTATAA CACCACCAAC 
5651 TTTGATTAGA ATTTTAAAGA GAAACAAATT CTCTTATGGA GTTTATCTTT 
57 01 TTATCACTTG CAAAATATGT TTTTGTAAAG AGATACTAAT TACTTAGTTA 
57 51 TTTGTAGTTA GCCATTCTTC TGATTAAAAA CCTAAAATTA AATCTTGAAA 
5801 ATGTGTTTTC CTTCAAAACA CATCATTTGA GAGAAACACT AAAGTAAGTG 
5851 TATGATTATC ATAGCATGTA CATAGGTGCT TCACAACCCA AAAAGAATAT 
5901 TGTCATGGGT AAGAATCAGT AAAGGAATTT CTCCTAATAA AACAGTAGCC 
5951 TATTAATTAA AGTAATGATA TGCAATACAG CAAGTTAAAG GGAACTGATC 
6001 CTGGTGGGAT TATTGAAAGA TATACCCTTG ACTATAGATT AGAAAATACA 
6051 GAGATGTTAT TTAGTGAAGA TATTGTGGTA CTCATTTATC ATCTGCAATT 
6101 CACTTGCAGA GGAAAAAATG AGTAATAAAT TCATTTGCAT TTTGGATTTG 
6151 TGTCTTTAAG TTGTGAAAAT ACACTTAAAT ATAACCATCT GTCCTTTGCT 
6201 CCTTCCTTCC TTCCTTCTTT CCTTCCTTCC TTCCTTCTTT CCATCCTTCC 
6251 CTCCCTCCAT CCTTCCTTCC TTCCTTTCTT CCTTCCCTCC TTTCTTTCTC 
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6301 TGTCCTTCCT TCTTTTTTCC TTTCTTTCTT TTTTCTTCTT TATTATTTCA 
6351 TTAATTCCCC CTTCCATTTG ACGTCTA^AA GCCATGTTGT TCTAGAGGAC 
6401 TTAAACTTAT TTTTTTCTTA ATAGCTTACT GAAAAATTAG TGATACAATT 
6451 TTTTATTTGA ATTGTATGCT AATTCATTCT GTTATTTCTT TTATTGAGGA 
6501 AGGCCCACTA CATTATGTGA GACTGTGGGA AAAGCTGAGA TATGGCTAAT 
6551 ACGAACATAT TGGGATTTTG AATTTCCTCA ACCATACCAA CCTAACTTTG 
6601 AGTTTGTTGG AGGATTGCAC TGTAAACCTG CCAAAGCTTT GCCTAAGGTA 
6651 GGACTATTGT ATTAAGGAAT ATTATGTACT TTATGACATG ACTTGTTTTC 
6701 CCTTGAAAGA TTACAACCTT AGTTATAGAA GGATGATGTT GAATGTCGTC 
6751 TGTTTGCAGC TCCATATTTA TTTTCCATGC CACAGGGGCT CTTATAGGTG 
6801 ATTATATGTC TTTTCGGTAT TATATTGAGA AAGTAGGCAG AAGAATTTCA 
6851 TGATTAGAAT AGATTTTAAA ATACTAGTAT TACAATAGTT TGGATAATAA 
6901 ATTGAATTAA TAGGGAATTG GAGCCATGAA GATCACTAAA AAGAATGCTC 
6951 TAGCCTTTCT CACAATCAAA TTGGGCTTAT GAACAAGGAT ATTTGTCATG 
7001 ATAGTACAGA AATAAGCATA TTTTCATGAG ACATATTGGA TATATTCCAC 
7051 AGGAGTTGGT GAGTGAGAGA AAATAAGTGA TGAAGGAAGA CAAAGAATAA 
7101 AAGAAAATTT CAATAAATGG AAAGTTTAAG TGTTTAATGA TAGTGATGAC 
7151 TTTTACTCAA ATAAGTGCTT AGAAGTCATC TTGTTTGTGA TTTATATGAT 
7201 GAATTCTGTG TTGTGACTAT CCACTTTGAG CTCGTGAGAA TGTTAGGTGA 
7251 GGTTTAATAA AAGCCATTTG AGAAAAACAA GGTTTCAACC TCTGTGGACA 
7 301 GAAATCTAAA TATCGATAGT TATCAGGACA AAGTAGAGCT CATAGAAATA 
7351 ATTTTGCAGC CTGCAGGTTT GTTTTGGAGT GAAAATAAAA TTGTATACTA 
7401 TATTCCTAAA TCATCAGAGG AAAAAATTTA TAGTTCAAGG AATGTTGAAA 
7451 GAAACAATAT TGAGAAGTAA AAGTGAGTAA TAGTTGTTAT AGTTTTTTAA 
7501 TAGTTTTGTA AGTATGTCTT GAGTTCACTG TCCCAAAAGT GGCTATTAGC 
7551 TCTAGCCTTG ACCTGACAAG GTTCTAGGAT ATTTAGTCAT GGATGTTCAT 
7601 AATCTACCTC TTACGGGATA CTTTTTATTC TGATGAACAG CCTAATGCCT 
7 651 AAGTGTGCAA TCTATACCAA GATTGTTCTT ATAGGGAACT TGTTTACACT 
77 01 GGAAGACACC ACTGTGTCTC TTGTATGACC TATGTCTTCT TTATCCCTAC 
7751 AAAGGTAACC ACATTATAGG AAACCCTGAC AAGGCCAGAT GTTATATTTG 
7801 TGTTGGTCAA GTGAGAAAAC ATGGGAGAAA CTTAACCAAA CACATAAAAT 
7 851 AACAGAAACA GTCTTCTTTG ACCATTTCTA GAGAAAAGAG TTCAGCATCC 
7 901 CTTGTAAGGC CACTAGGAAG AAGAAAATTC TCTGGGAAAA GCACATTCAA 

7 951 CCAATGAATG GAGACCAAGA AAGAGAGTGA GGGATCTATG TGCCAAAATG 
8001 TTAACTGGGA TCCAGGGTGT TACCTAGGTG GGTTTCCAAT GGGGAACTGT 
8051 AATTGGTAGG TTTAATGCAA GCAGGCACAA AGTCCATGGA GGCATTCTGA 
8101 GACTGAAAGA TAGTCACTTT GGCATATCTG CACAGAATCT GATCAGTGAT 
8151 TCAAGCCCAA GTAGGCTGTA TCTAGTTGTC CTATAGGGTG GTTACCAGGA 
8201 GGCAGTGTGT AAGTAAAAAT CCTGACTGAA CACATTGAGG AAATGGAAGG 
8251 AGGTGGAAGA TTTTAAACGG TGTCAGTGTT GACTAAGACC TGCTTCTGGT 
8301 ATGGAAAATT CAACTTATAT TTTAAATGCA TAGCCAGACA ACATAAAATT 
8351 ATAAGAATTT ACCACAATAG CTATGGTAAC AATACTGGGT TTACCTATTA 
8401 CTACAGAGTG AAAAGAAAAC CCTCATTTCC CATTTTATGG AAATATAATC 
8451 AAAATCCTAT AAGGAAGGTT TCAGAGCCAG TAGGATTTCC AGAAAAATTA 
8501 TTGGTTTTAT AGTAAGATGT GTATTGATGA ATATAATTTT ATTTATTAAT 
8551 TATTAATATC ACTTTACTTA CCAGGAAAGT TATACCAGAA AACCAAGCTC 
8601 TCTTAAGCCA TGGCATCTGT ATCTAAAATA GAAATACAGA AGGAGAGCTG 

8 651 ACAATTTCCA TCATTCTCTA GGTAATCTCC CATGCCATTC TACCCTTTAT 
8701 TCCCACACTC CCAGTTTTAC ACACACACAC AAACACACAC ACACAAACAC 
8751 ACACTCATAG AAATAATCAT AGAAGACATA TTTTTAAAAA AGTTAGATCC 
8801 ATACAGTAAT AATTTATTAG GTAAAAGCTT TTGTGCTGAT AATTTTACAA 
8851 GTTTAATTGA GATATATTTT AGGGCTGTCT TACACTAAAT ATTTATTTTT 
8901 ATTTTTTAAA TTTGACATGT AATAATTGCA CATGTTTAAG AGAAATGCTG 
8951 TGGTATTACA ATACATTTAA ATGTTGTGTA ATAATTACAT CAAGATAATA 
9001 AACCCATCAT CTAAATATTT ATCATTTCTT TGTGGTGATA ACATTCAAAA 
9051 ACCTCCTTTC TGGCTATCTT GAAATATGTA ATACATTACT ATTAACTATA 
9101 GTTACCCAAC AACTTAATAT AATAACAGAA CATATTCTTC CAAATTTAAA 
9151 CGTTGTATCC ATTGATCCAC CATTTCTCAT TGCCCTCCCT ACTATCTCTT 
9201 CAGCCTCTAG TAACCACAAT TCTACTCTCT AATTATATTA TGAATGCATT 
9251 TTTTGATTCC ACATATAAGG GATACCATGC TATCTCTGCC TGGATTATTT 
9301 CAGTTAACAT TATGCCCTGG AGGTTCATTC ATGTTTCTAC AAATGACAGG 
9351 ATTTCATTCT TTTTTTTCCA ATATATATTT AATGAAATGG ATATATATAA 
94 01 ACATTGGAAA ATGTATATAT ATATATATAT CTCCAGTGGA ATGCTATTGA 
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9451 GCTATAAAAA AGTTAATATA TAATAGAAAT AAAGCTTATA TATATCTAAT 
9501 GGAATGGATA TATATATATA ATGGAATAGA AATATATATC TATACATATA 
9551 AACACACGCA ATATACATAT CCATTTCATT GCATATATAT ATATATAGAG 
9601 AGAGAGAGAG AGAGATATTT TCAAATGTGT GTATATATAT CCAATGGAAT 
9651 GGACATATAT ATATGTATAT TTTTTCCATA TTTTCTTTAT GTATTTCTTC 
■9701 ATTAATGGAT GTTTAGGTTG ATTCATCCCT TGGGTATATG AATAATGTTG 
9751 ATGTAAACAT AGAAGGACAG ATATCTCTAT GACTTCTTAG TTTATTTAAA 
9801 TATACACCCA GTAATGGAT^ TGCTGTATAA TATGGTAGTT CTATTTTCAT 
9851 TTTTTGAGGA ACTACCATAC CGTTTTCCTT ACTAATTGTA CTAATTTGCA 
9901 TTTCCCTCAA CAGTTTATAA AAGATCTTCT TTCTCTGCAT ACTTTCTAGC 
9951 ACTTGTTATT TTTGCCTTTT GATAATAGCC ATAACAGGGG TGATGTGATA 
10001 TCTCATTGTA GTTTTGATTT GCATTTCCCT GATGATTAGT GATTTTGAGC 
10051 ATTTTGTAAT TATACTTCTT AGTCACTGAT AGTCTTCTTT TGAGAAGTGT 
10101 CTATTCAGGT CTTTTGCTTA TTTTTTAATC AAATTAGTAA TTTATTTTTA 
10151 TTGACTGATG TGACTTCTAT GTATATTTGA GATAGTAACT TATTGTCAGA 
10201 TTCATAGTTT GCAAATATTT TTCATGTTGT GAATTGTCTC TTCACCCTGT 
10251 TGTTTGCTTC ATTTTCTCTG CACAAGCTCA ATGCTTTGAT ATAACCCATT 
10301 TATCTACTTT TCCTTTTGTT GGCTGTGCTT CTGAAGTCCT ATCCAAAAAA 
10351 ATCCTTGCCT AGACCAATGT CACAAATCAT TCCTCCTACA GTTTCTTCTA 
104 01 GTAGTTGTAT AATGTTTGGC CTTATATTTA ACTTTGTAAT TCATTTTTAC 
10451 TTACTTTGTA TATGGTGAGG GATAGAGGTC TAGTTTCATT TTCTGCATGT 
10501 GGATATGCAG TTTTCCTAGC ACCATTTAGT GAAGAGGTTG CCTTTTTTCT 
10551 ATTATGTGTT CTTGGCACCT TTGTCAAAAG TCAGTTAGCT GCTATATTCC 
10601 TCCATTTGTG TTGTTATAGA GGAACACATG AGACTAGCAA ATTTATATAT 
10651 CAAATAGAAT TATTTGAATG ATAGTTCTGC ATACTGTACA AGAAGGACAG 
10701 CACTGACTTC TGCTTGGCCT CTGGTAAGGT TCTCAAGATG CTTCCACTTG 
10751 TGGTAGAAGG CAAACATGAG CTGGTATATG CAAAGGTCTC ATGACAAGAG 
10801 AGGAAACCAT AAAGAGGGGA TGTGAGGGAG TGCCAGGTTT TGTAAAACAA 
10851 CTAGCTCTTC TGGGAACTAA TAGAGTAAAA ATTCGCCTCC CAGGCAGGGG 
10 901 ATTAATCTAT TCATGAGGGA TCTGCTTCCA TGACAZ\AGGC ACATTCTGTT 
10951 AGATTCTACC CCCAATATTG GGGATCAAAT TTTAACATGA AGTGTGGAGG 
11001 GCTCAAATAT CCATACTATG GCAGCAGTAA ATGCATAAAT TTATTTTGTG 
11051 GATCTCTATT CTATATAGTA TTGGTGTATG TATCTGTTTT CATGCCACTG 
11101 CCATACTGTT TTGGTGATGA TATCTATGCT ATATATGTGT GTGTGTATAT 
11151 ATATATTATA TATATGTATA TATGTGTATA TTATATATAT GTATATATGT 
11201 GTATATTATA TATATATAAT ACTTTAAGTT TTATATATAT ATAAAATACT 
11251 TTAAGTTCAA GGGTACATGT GCAGGATGTG CAGGTCAGTT ACATAGGTAT 
11301 ACATGTGCCA TTTTGGTTTG CTGCATGCAT CAACTCATCA TTACATTAGG 
11351 TATTTCTCCT AATGCTATCC CTCCACCAGC CACCCAACCC CCAACAGGCC 
11401 AGGTGTGTGA TGTTCCCCGC CCTGTGTCCA TGTGTTCTCA TTGTTCACTT 
11451 CCTACCTAAA AGTGAGAACA TGCAGTGTTT GATTTTCTAT CCTTGTGATA 
11501 GTTTGCTGAG AATGACTGTT TTCAGCTTCA TCCATGTCCC TCAAAAGGAC 
11551 ATGAACTCAT CCTTATTTAT GGCTGCATAG TATTCCATGG TGTATATGTG 
11601 CTACGTTTTC TTAATCCAGT CTATCACTGT TGGACATTTG GGTTGGTTCC 
11651 AAGTCTTTGC TATTGTGAAT AGTGCTACAA TAACCATATG TGTGCATGTG 
11701 TCTTTATAGC AACATGATTT ACTATCCTTT GTGTACATAC CCAGTAATGG 
11751 GATAACTGGG TCAAATGGTA TTTCTAGTTC TAGATCCTTG AGGAATCCCC 
11801 ACACTGTCTT CCACAATGGT TGAACTAATT TACATTCCCA CCAACAGTGT 
11851 AAAAACGTTC CTATTTCCCC ACATCCTCTC CAGTATCTGT TGTTTCCTGA 
11901 CTTTTTAATG ATGGCCATTC TAACTCACAT GAGATGGTAT CTCATTGTGG 
11951 TTTTTGTTTG CATTTCTCTG ATGACCAGTG ATGATGAGCA TTTTTTCATG 
12001 TGTCTTTTGG CTGCATAAAT GTCTTCTTTT GACAAGTGTC TGTTCATATC 
12051 CTTTGCCCAC TTTTCAATGG AGTTGTTTGT TTTTTTCCTG TAAATTTGTT 
12101 TAAGTTCATT GTAGATTCTG GATATTAGCC CTTTGTCAGA TGGGTAGATT 
12151 GCAAAAATTT TCTCCCATTC TGTAGGTTGC CTGTTCACCC TGATGGTAGT 
12201 TTCTTTTGCT GTGCAGAAGC TCTTTAGCTT AATTAGATCC CATTTGTCAA 
12251 TTTCGGCTTT TGTTGCCATT GCTTTTGGTG TTTTAGTCAT GAAACCCTTG 
12301 CCCAGGCCTA AGTCCTCAGT GGTATAGCCT AGGTTTTCTT CTAGGATTTT 
12351 TATGGTTTCA GGTCTAACAT TTAAGTCTTT AATCCATCTT AAATTAATTT 
12401 TTGTATAAGA TGTAAGAAGG GATCCGTTTC AACTTTCTAC ATATGGCTAG 
12451 CGTGTTTTCC CAACACCATT TATTAAATAG GGAATCCTTT CTCCATTTCT 
12501 TGATTTTGTC ATATTTGTCA AACATCACAT GGTTAGAGAT GTGTAGTGTT 
12551 ATCACTGAGG CCTCTTTTCT GACTCCATTG ATCTATATAT CTGTTTTGAT 
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12 501 ACCAATACCA TGTTGTTTTC GTTACTGCAA CCTTGTAATG CAATTTGACA 

12 651 TTCAGGACCA TGATGCCTCC AGTCTCTTTT TTTTTTTCTA AATAATTTTT 
12701 TTGTCAATGT AAGCTCATTT TCGCTTCTTT CTGATCCATA AAGTATTTTT 
12751 TTCCCATTTT GTGGAGAACG CCGCNNNNNN NNNNNNNNNN NNNNNNNNNN 
12801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNGGCACA CCTCGTGCGC 
12851 ATATATATAT AT AT AT AT AT ATATACCTCT ATATATATAT ACATACATAC 
12901 ATACATACAC ACCTCCTTGT CTGGTGTGGG ATCAGGGTAA TGCTAGCCTC 
12951 ACAAGATGAT ACTGAAGTGT TTTTGCCTTT TTGACTTTTT GATGGTTTGG 
13001 AAGAGTGAGA AAAAGTGTTA TTAATTATTC TTTAAATTTT GTTGAATTTC 
13051 ATAGTGAAGA CCTTAGCTCA CTGGCTTTTT TAATGAGAAC TTTATTACTG 
13101 ATTTAAACTT CTTCTTCATT ATTTATTTCT GCCTTGTTTT TATTTCTTCA 
13151 TAATCCAGTC CTATTTTATG TGTCCACTAA ATTGTTTATT TTCCTAGAAT 
13201 TTTTCCATTT ATTGGCATAT GCATGTCCAT AGAAGCCTTT TATAGTCCTT 
13251 TTCATTTCTA GTGTCATTTT TTTCCTTTTT TTTAAGAATC CTTAAGATTT 
13301 TAGAGATGAA ATGTCACTTT GTTACGCATA CTGGAGTGCG GTGACATTAT 
13351 TATAGCTCAC TGAAACCCAA ACTCCTGAGT TTAAGCAATC CTTCTACCTC 
13401 AAAATTCCAA AATTCCTGAG TAGCTGAGAC AGGCATACAC CATCAAGACT 
13451 GGCTAATTTA TTTCAAATTT TGTAGAGATG GGTTCTTACT AAGCTATTCT 
13501 CAATCTTTGG GCTTCAAGTG ATTCTTCAGC CTCTGTCTCT GAAAATGCTG 
13551 GGTTTATAGA TATGAGCCTC TATGCCTGAT TTGCTTTGTC TCTTTGTAAT 
13601 CTCCCATTTT ATTTGTGTCT TTTCTGGTTT GTTTCATTTT GTTATGTTTT 

13 651 CAGTTACCTT GCTAAAGCTT TGTCGATTTT ATCTCTTCAA ACAACTAACT 
13701 CAATATTTTG CTGATTTTCC ATATAGTATT TTATTTCTAT TTCATTTATT 
13751 TCTGCTCTAA TCTTTGTTAA ATATCTTGTT TTCCTAATAA TTTTGAGTTT 
13801 CCTTGTTCTT GTTTTCTAAT TCCTTGCGAT GTTATCATAA ATTGTTTATT 
13851 TGATATCTTT CTACTTTTTT GATGTGTGTG TTCGTTGTTG TAGACTTTCC 
13901 TCTTTATTAT TCTGATTTCT TCCTCAATTC TCTAATATTA TGATTGCATT 
13951 ATTTTCCAAG TTTCTTTTGT TTTTTTATTT ATAGTTTATG TGATTCCTGA 
14001 ACTTGTCAAA GAGATTATTG TGAATTTGAT GTCGGATATT TAAGCATTTT 

14 051 CAAAACTTTT GGTGCATTAT TGAAATTTTA TTGGTTTATT TTAGAGATGT 
14101 CATACTTCCC AGTTTTTTTT TAACAATACT TGCTCTTTAT ATTGATGTCT 
14151 ACATATTTAA AAAGATAACC ACCTGATTCA GCTTTTTAAG GTGATATGCA 
14201 GTGGTGTTAA GTGTGTACTG CTTAATATCA GAGCTGAATC ACTGCCCTGA 
14251 GGATTCTTTC TGTTCTGAGG AGAGCTTGTA GTTAATAGCA GAACCTAAAT 
14301 AGTGCAGTAG AGCTAAATCT CTTCCATGCT GTTGTTTTCC TGTCTGGGGA 
14351 AGACTTATCA TGACCATGAA AACATAATGC TGTGCCAGAA CTTAAACCCA 
14 401 AACCTGTAGT AATTTCTGAG TTGAGGAAGG CTTAAGAAAT AACTGGAACT 
14451 TAGTTACTAA CCTGATAGTT GTTTCTGAGT CAGAGAAATG CTCTGCATGA 
14501 TCACCTGGGA TATTTGTAAA ATCTAACCAA AGATTCTAGC CTTCCCCTTG 
14551 GATTGTGCCT CCTGTACTAC TGTAGTGCTG GCTAGGTCCT CATCAGTGAA 
14 601 TTCCCTGCTG ATAGGACCAC AAAGCATCTG CCAAGATCTG TTTGCCATTT 
14651 GCTGTGATTA GTGCTTCTGC TCTTTGCTTC CAATTCAACT CAGGTGGTTC 
14 7 01 AGCCCTTCTG ACACTCCTAA TACCTCCTGT GGGATGGAAC ATAGAAGGCT 
14751 TCTCACAATG ATTCACACAC TGATATGGAG ATTGAATGTC CAGTTGCAAC 
14801 TATTTTCTTC CACCTGTGTA ATTGCAGGTA CAGGGAAGTT TTCTGTGACT 
14851 GATGCTATTT TGGTTTGGAG AATGGGGTGA TGTGGCACAA TGATCTTTCT 
14 901 TCTTTCTGGT CATGGATTTT TTAATTTCCA TGAACCCATA AGATTTTTCA 
14 951 CTTTTCTTCT GAGCTCTGGT GCTTTCAGAG TGGTATTTTT ATATTCGAAT 
15001 AGTTGCTAGT TGTACTTTTA AAAGCGATTG ATGCTGGAGG TCTTCTATTC 
15051 CACCATCTCG CTGATGTCAG TCCTCAAATA ATAATTTTAT ATTTTAGCAA 
15101 ATTATTTTGG TTTTAGGATT TTGTGTCTAC GTGACACAGA CATGAAAAGA 
15151 GATGTACTCA TTACTGAAAC TTTTTGCATA CTGTTTTGGT TGTGCGCCTT 
15201 TTCTAGTATG AATGATTACA TATTTAAGCC ACATGTTTTA TACATAGACT 
15251 GTCCTTTAAA GAGACTAGAT AGTTCTGTGT GTCAGCATAT AGGGACAGAA 
15301 TATAACTACA CATTAATAAT TTCTCAAGTA TTTATTTTAG AAGTGTAAGT 
15351 AACCTTTATT TTAATTTTTG TTATATTATG CCTCTGTAAT GCAGATAAAT 
154 01 TTTTATCTTC AGGAAATGGA AAATTTTGTC CAGAGTTCAG GGGAAGATGG 
15451 TATTGTGGTG TTTTCTCTGG GGTCACTGTT TCAAAATGTT ACAGAAGAAA 
15501 AGGCTAATAT CATTGCTTCA GCCCTTGCCC AGATCCCACA GAAGGTCAGT 
15551 AAAACCTCCA ATCCTGATAA GCAGCTATTC ACATAATGAA ACAGTATGGT 
15601 TTTATTTGGG TCTTGAATCT CATTTTCCAC TTAGCATAAC AGGTACCAAA 
15651 ATTTGCAAAA CATTATAGTA GTGTACATGG GCATAACTGA TCATTTGCCT 
157 01 ACTGAGTCTT GCTGTTACTG GAAACAACTT TCTTGATTGT CATTTGTTTA 
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15751 TAATA7\AATA GATATAATAA ATAAAGCTCT ACCTTATATT TTAGGATTTG 
15801 AAATCTAAAA GCGTGTGCCA ATGATTCCAA AAAAAAATTC TGACATCTAT 
15851 TATTTCAAAG GACCAGAAAA AGGAAAACTG ATATAAAAAA AAAAAGAAGA 
15901 ATCAATCTCA AGAATATCTT CTCATATTTG TGTGTATAAA AACTGTATTC 
15951 AGGGTAGTTT TGCTTAGAAA TAAAAGCTCA GATTAATGTA GTCTTTCTAA 
16001 ATAATTAGAA GTTTCAAAAG TAAAATGTCA ATTACAATTA TAGTATAGTA 
16051 ACAATTATTT AAGTAATGTA ATTATTTATG ATACTCCACT AATTTTAACT 
16101 TTATTATTAC TGTAATTCTA GAATTTCACA CTTTAGATAG XGCTATATAT 
16151 AAACTATCCA AAAGATATTT CATTTTATAT TTAGCTAAAA TACTTCAAAC 
16201 TCAATAAAGG CAAGCATACT AATTAGGAAT TTGAAATATT GTAATTTCAA 
16251 TTATGAAATT ATCTGTTAAG TAGTTTGAAA CATCTATGCC GTTCTTTGTT 
16301 TTCAAATGTA TAAAATTTGT ATAGGTGTCC AACAAAGAAA AATTGTGTAA 
16351 AAAAAAGGTA CAATCTCAAA GAAAATTTAT CATTGAACAG TGGAACATAA 
16401 GTAATTTTCT AGCTCATTCT TCTTCAATAA AACAATTAAA TATAAGAAGA 
16451 AAGAGGCCAG GAAGGAAATA GAGAAGAAAA GACACCCGAT TATCCAAAAG 
16501 ACACACATAA TTGAAAGCAA ATTTTTATCT GCAGGGAACT GTAAATTTGA 
16551 TGGTAGAATG AGATTGGCTC CATGAGTTAA AATGACACAC AGATCAGGTA 
16601 CTTATAAAAT TTTTAATTCT TATATAAAAA TAGATTAGCC ACTGCTGAAT 
16651 TATTTTTTTA AATATTCACT GGTATTCTCA TTCTCAAATA TTTTTAATTG 
16701 GTAATAAAAT AATAATAGCA TACCTAATAG GCAACTGGTA CACATTATTT 
167 51 TAAAAGATCT TTGTAAAACG TCCTACTATA TCTTTCAGTC TTTACGCGGT 
16801 AGCTCTACAC ACCCCTGTCT CAACCATCAC CTGAAGTACA ATGAGTTTAT 
16851 AATTTATAAC TATATCTACA TCCTTAGAAT GCTAATATCC TGTGGTTCAC 
16901 TCTGTGAAAT ACATGTGTTT CTTCCGTAGG TGTTATGGAG GTACAAAGGA 
16951 AAAAAACCAT CCACATTAGG AGCCAATACT CGGNNNNNNN NNHNNNNNNN 
17 001 NNNNNNNNNN NNNNNNNNNN NNNNNISfWNNN NNNNNNNNNN NNNAAAAAAA 
17051 AAGACCCAAT CCCAAAGAAA ATTTATCATA GAACAATGGA ACATAGGTAA 
17101 TTCTCTAGCT CATTCGTCTT CAATAAAACA AATAAATATA AGAAGACAAA 
17151 GGTCAGGAAG GAAATAGAGA AGAAAAGATA ACCGATTATC CAAAATCACA 
17201 CACAAAATTG AAAGCAAATT TTATCTGTGG GGAACTGTAA ATTTGATGGT 
17251 AGAACCAGAA TAGTTCCATG ATTTGAAATG ACACAGAGAT CATGTACTTA 
17 301 TAAAATATTT TATTCTTATA AGAAAATTGA GTAGCCAGTG CTGAATTACT 
17 351 TTTTAATGAT TCACTGATAT TCTCATACTC AGATATTTTA A7TGATATTA 
17 401 AAATAATAAT AGTATACTTA ATAGTCAACT GGTACACATT ATTTGAAAGG 
17 451 ACTTTTGTAA AAAGTCCTAC TATGTCTTTT ACTGTTTACA CAGTACCTCT 
17501 ACATACCCCT GTCTCAACCA ACACCTGAAG TACAATGAGT TTATAATTTA 
17 551 TAACTATATC TACATCCTTA GAGTGCTAAT ATCCTGTGGT TCAATCTGTG 
17 601 AAATACATGT GTTTCTTCCA TAGGTGTTAT AGAGATACAA TGGAAAAAAA 
17 651 ACCATCCACA TTAGGAACCA ATACTCGGCT GTATGATGGG ATACCCCAGA 
17701 ATGATCTTCT TGGTAGGTCT ATGAGAAAGT AAAAATATGA ACTAGACGAG 
17 751 GAAAAAATGA ATAAATGTTA AACAGCAAGC AAATTCAGCA AAGATCTAAA 
17 801 ATTATAAAAC TTTATTTTAC TTACTCTTTT GAAGCAGATA TAATTAAAGG 
17851 ATTGACTAAA ATTGTATAGA TTCACACTTT CTATTGTTAA GGTGAGAGTG 
17 901 ACAGGAAATT CAGAAGGAAT TAATGCCTAT TTTTCTGGAG ATAGAAATGA 

17 951 TCTTTAGTAG CAATGCTCCA TGTGCTCACC TTCTAAAGAA AGTGCTGTAC 
18001 GCTTCAGTGA GTTATCTCGT AATTCCCATC TGTAGTTTTT AAATAATTTT 
18051 AAAAGTTTAG AATAAAATAT CTCACCATTT CTCATCCAAT TTACATACTA 
18101 GGTCATCCCA AAACCAAAGC TTTTATCACT CATGGTGGAA TGAATGGGAT 
18151 CTATGAAGCT ATTTACCATG GGGTCCCTAT GGTGGGAGTT CCCATATTTG 
18201 GTGATCAGCT TGATAACATA GCTCACATGA AGGCCAAAGG AGCAGCTGTA 
18251 GAAATAAACT TCAAAACTAT GACAAGCGAA GATTTACTGA GGGCTTTGAG 
18301 AACAGTCATT ACCGATTCCT CGTAAGTACT ACTGCTTGTA CAGACTGATC 
18351 TAACATTGAC TATGTTATAC ATTATACCAG AAAATGTTAA ATATCATCCT 
18401 GGTAGACATG TTGAGGGATT TTACTCCACA ATATTGAGTC ATTCATCACC 
18451 TTGTTACTGG AATAGTTGTG GAAATTGTAG TTCATAGAGT GTCAAACTTT 
18501 CTTCATGGAA ATATTAGGTT TAAGTTAACA ACTGGCTTAC TAAGCTTTTA 
18551 TTCACATCTT AATTTTACCC CATTTTGTTA AGAATATACT CTTTCAGTCT 
18601 CTCCACTATA TCTGTTTAAT ACTATGTAAC CAACAATATT CATGTCACAA 

18 651 CCAGAATCAA TCTTTTACTG AACATGTTCT TGGCTTGCAT AACATATACT 
18701 ACGGTTTATC TACGTGTCTT TTATGAAAAC AAAACTACAA CTTTCTAAGT 
18751 TCTATGTGTG TTTTTCCCTT CCAGTTATAA AGAGAATGCT ATGAGATTAT 
18801 CAAGAATTCA CCATGATCAA CCTGTAAAGC CCCTAGATCG AGCAGTCTTC 
18851 TGGATCGAGT TTGTCATGCG CCACAAAGGA GCCAAGCACC TGCGATCAGC 
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GTGATTGGGT 
AAAATGTTTT 
AAAAGAGGGA 
CCTGTTAATT 
ATTATCTATT 
TCATGAGGTT 
CTTTACTCTC 
TTTTCCTATT 
CTTAAGTGAT 
ACACATGGAT 
ACCACACAAA 
ATAATGATGT 
TCAGTGATCT 
TTAGAAAAGT 
ACATTTCTGC 
AAAAGACCTT 
TTCTCACATA 
TTGGGATTAG 
AAATTTATTT 
AAATGATGAA 
CTGTAGAAAA 
CTTTTCAGTC 
CATAGTAAAA 
ATATTAAATA 
AATAATGTTT 
TGTCATATAA 
TGGCTTATCT 
GGTATATCTA 
TATAAGTATT 
TACTGTTTTG 
TTACAATATT 
TCATCTTTTC 
GGTGGGAAGC 
AGGTTGGACG 
CTCATAAGAC 
CAATCAGAAT 
GCAATTTCTA 
AAGCTCCCAG 
ACCTTTGGAC 
ATTAAATTAC 
TTAAATAACA 

GTCACCAAAC (SSQ ID NO: 3) 



FEATURES 




Start: 


197 


Exon: 


197-911 


Intron: 


912-6498 


Exon: 


6499-6647 


Intron: 


6648-15412 


Exon: 


15413-15544 


Intron: 


15545-16929 


Exon: 


16930-16940 


Intron: 


16941-17532 


Exon: 


17633-17712 


Intron: 


17713-18101 


Exon: 


18102-18321 


Intron: 


18322-18774 


Exon : 


18775-19051 


Stop: 


19052 



CHROMOSOME MAP POSITION: 

Chromosome 4 



18901 TGCCCATGAC CTCACCTGGT TCCAGCACTA CTCTATAGAT 
18951 TCCTGCTGAC CTGTGTGGCA ACTGCTATAT TCTTGTTCAC 
19001 TTATTTTCCT GTCAAAAATT TAATAAAACT AGAAAGATAG 
19051 ATAGATCTTT CCAAATTCAA GAAAGACCTG ATGGGGTAAT 
19101 CCAGCCACAT AGAATTTGGT GAAAACCTTG CTATTTTCAT 
19151 CTGTTATTTT ATCTTAGCTA TATAGCCTAG AATTCCACGA 
19201 GTGAGTATAT CTCATTCTTT CGTTGTATTT TCCTAGGTGT 
19251 TTCTCTCACT TTGTGACACA AGGACATGAA TACATCTAAA 
19301 TCTGATATGA CTGTTTTGAT GATGTCATTA CTTCTATAAC 
19351 AGGGTGACAT GCAATATGAT TATTCCTGGT GTGCGCCCAA 
19401 ATAAAGAGGT AAAAAACTTA AAATTCACAA AATTCAGTAA 
19451 TCAGGTAAGT GTTCTATGAG ATTAGCTGGC TATGAGAAAC 
19501 TTCTTTTTCA ATTTAAATAA GCCCTTCTAC ATAGCCAGCA 
19551 CAGAAAATAA ATTGCTAATA ATGATGACAT GGCATTATGC 
19601 TTGCTGTATT TCCATAGACC TCATCTAGAT GTCATGGCCT 
19651 CATCACTCAA CCAATACTTT TTTCTGTTTT CTTGATGATA 
19701 TCTCATGATT GCCATCAAAT AACAAAAGAA ACTATTTTTT 
19751 GAGAACATGT CAGTAAGATA TTCAAGGTGA ACAGATATTT 
19801 TAACTATTTG AAATATGTGG TGATAATTAC TGAGTTTATA 
19851 GATAGTACAC TTAAAGAAGA TTTATATGTT TATTCTTTAA 
19901 TACTCATAAT TCTTATCTCT ATAATCAAAA GTATAATTTA 
19951 ATAAAGAGAT GCTTGTTCTG AAAGTAAGAT CAGTGAACTG 
20001 TCAATCTTTG AGAATTGTAA ATTCATCAAA TAATTGCTTA 
20051 ATTTAAGGTA TTAGAAAACC TGCATAACAA ATAGTATTAT 
j-J 20101 TTTTGATATG TAZ^AGCTCTA CACAAAGCTA AATATAGTGT 

20151 ACACTAGTAA GCAAATATGT TAATCTTCTC ATTTTTTTAC 
20201 TCTTAGTGAT ATGCCTATTA ATAGTTTTAA ATAAATAAAT 
W 20251 GGCTTTTTGA AAATTTTGAA ATTCTTACAG ATGTTGATTA 

4^ 20301 CAAATTAATT TCAATTTTAA AATGATGATA TAAAAATAAA 

iJ-j 20351 TTTCTTGTGT ATGTATACAA TAAATATAAA TAAAATTGTT 

204 01 AAAGTTTCTT AAGTTTTTAC ACTGATATGT TTTTTGACTT 
f: 20451 ATTATAATCT AGGAAAAGCT GATTATATCT GTTTTAAGCC 

D 20501 TCTGTAATTA AACACAGTAA TTTATTAACA TGCTGTGACA 

iS 20551 CATTTCTGGA GTTGAGCCTG CTGACACTCT GGAGCTTTTT 

O 20501 TTCATTGTAT GTGGGACTCT CTGCCTCTCG ATAGCTGTTG 

20651 TCTCCTTCAT CAATCTGGCA TTGAATTTTG CGATCAGTTG 
l''^ 20701 CCAATTGGCC TTGCCGTTTT AGTATGTTCT ATCTTAACCA 

20751 ACCAGGAGCC TGCCCAGGTT TGTTCTGTCT TCCCTGTAAG 
ffl 20801 CATAAATATT CTAAATTTTA CACTACTAAT CTATTAACCA 

p 20851 CATGTTCACT TTAGGTTGAG CATAGTGTGA TGAGATGCAA 

rt 20901 AATCCTATAG GTGTGTGTTA TAAATTTTAA AGTGTATAAA 

^''^ 20951 CATTCTAAGT ATCCAACAAA GGTCAAAAAA ATGATATAAA 
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ALLELIC VARIANTS (SNPs) : 

DNA 



JTLJO-L 1 LVJii 


Ms. 3 OT 


Minor 


DoKiain 


17 35 


A 


Q 


Tn "1" Ton 


1922 


A 


G 


Int ron 


23 61 


c 


T 


Int ron 


7371 


G 


c 


Intron 


9558 


G 




Xnt ron. 


1057 9 


T 


G A 


Intron 


10625 


C 


T 


Intron 


11147 


A 


G 


Intron 


15131 


C 


T G 


Intron 


15221 


A 


G T 


Intron 


15778 


T 


C 


Intron 


15895 




A 


Intron 


19786 




T 


Beyond 


20157 


G 


A 


Beyond 


20246 


T 


C 


Beyond 


20681 


C 


A 


Beyond 


20819 


T 


C 


Beyond 



Context : 



DNA 

Position 

1735 TACTTTACTGATATACACCTATTTGACTTATTTTTATTTCTTTGCTTTACTGATAAAAAG 
TTGTTTTGCTTTGCAATTTTCATATAGTTGTGATCAGAGCTGGTCAATGCAAGACATGTT 
TTTATCCAAATATGTTTGAGAATTATGTAGAAACATGAAAAAAGGTACAATTATATCCGA 
CACTAAAATATTGTTTAATGTATTCCAACGAATTCTTATGCATAGACTGTTTCACAGAAC 
TAATATTCAGAGGATCCCAGTTCAAATGTCCTTAGCCTTAGACATGATTTGAATTTACAT 
[A,G] 

TATTGATTTGCTTTAAATAATTTTCCATTCAGTAAGCTGTGCCTAGCTGCAGATAGCCTA 
CCAGGCTTTATGGATCTAGGTAAACAATACAAATCTCTTGGCCTCAAGTCTACATTCAGA 
TATTAATTTAAAGGGGTACAGCTATATAGAGGTCACTGGCAAATTTTGGTAAAATAGGAT 
TATAGTAAAAGCCCCCTGACAAGATTGAAATTTAAAATAAAACAAAAGTGTTATCAAAGG 
GGTGAAAGAGCATTTTCCAATAAACAAAAGTGGGTTCTGGCCATGCATTCAGAAATTCCC 



1922 ATATTGTTTAATGTATTCCAACGAATTCTTATGCATAGACTGTTTCACAGAACTAATATT 
CAGAGGATCCCAGTTCAAATGTCCTTAGCCTTAGACATGATTTGAATTTACATATATTGA 
TTTGCTTTAAATAATTTTCCATTCAGTAAGCTGTGCCTAGCTGCAGATAGCCTACCAGGC 
TTTATGGATCTAGGTAAACAATACAAATCTCTTGGCCTCAAGTCTACATTCAGATATTAA 
TTTAAAGGGGTACAGCTATATAGAGGTCACTGGCAAATTTTGGTAAAATAGGATTATAGT 
[A,G] 

AAAGCCCCCTGACAAGATTGAAATTTAAAATAAAACAAAAGTGTTATCAAAGGGGTGAAA 
GAGCATTTTCCAATAAACAAAAGTGGGTTCTGGCCATGCATTCAGAAATTCCCCAACAAT 
TCTTTAAAAATCATGGAGCAGCTTGATATATAAGAAATTCATTTAATAACTATATTTATT 
ATGTAGCTCCAACTTACTAAATTATTGATTATTATATATTTTATAGAATTATCTATTGTG 
AGTCTAAATCAAGAGTATATATTCAAACAACTATAGGAAAAGGGATATCAGTCAATTTCA 



23 61 CAGCTTGATATATAAGAAATTCATTTAATAACTATATTTATTATGTAGCTCCAACTTACT 
AAATTATTGATTATTATATATTTTATAGAATTATCTATTGTGAGTCTAAATCAAGAGTAT 
ATATTCAAACAACTATAGGAAAAGGGATATCAGTCAATTTCAATTCAAGGATTTATTTCC 
ATAAGTGCTTACGCACAGGTGTATTTCATTTTATTATACATTGCTTTATTGTCCTTCACA 
AAAATTGCAATTTACAAATTAAAGGTTTTTGAAAACCTTGAATCAAGCTAATCAATTTGG 
[C,T] 

GTAATATTTCCAACAACAAGTGTGTACTTTTGACTCTATCACATATTGGCATTTATCATG 
CTTTTTCAAATTTTTCATTGTTATATCTGTTACGGTGATCTGGGATCAGTGTTCCTTGAT 
GGTTACACGTTTATTAGCTTGGGGGCACCTTGATGTGTTACAATATAAGACAGCAAACTT 
AATTATAAATGTTGTGCATGTACTAACTGCTCCGCTGATTCGTTTCCCCATCCCACTTCT 
TCTTAGGCCTCCCTATTCCCTGAGACACAGTAATATAACATACAATGACTTCTAAATGTT 
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7371 AAATAAGTGATGAAGGAAGACAAAGAATAAAAGA?VAATTTCAATAAATGGAAAGTTTAAG 
TGTTTAATGATAGTGATGACTTTTACTCAAATAAGTGCTTAGAAGTCATCTTGTTTGTGA 
TTTATATGATGAATTCTGTGTTGTGACTATCCACTTTGAGCTCGTGAGAATGTTAGGTGA 
GGTTTAATAAAAGCCATTTGAGAAAAACAAGGTTTCAACCTCTGTGGACAGAAATCTAAA 
TATCGATAGTTATCAGGACAAAGTAGAGCTCATAGAAATMTTTTGCAGCCTGCAGGTTT 

[G,C] 

TTTTGGAGTGAAAATAAAATTGTATACTATATTCCTAAATCATCAGAGGAAAAAATTTAT 
AGTTCAAGGAATGTTGAAAGAAACAATATTGAGAAGTAAAAGTGAGTAATAGTTGTTATA 
GTTTTTTAATAGTTTTGTAAGTATGTCTTGAGTTCACTGTCCCAAAAGTGGCTATTAGCT 
CTAGCCTTGACCTGACAAGGTTCTAGGATATTTAGTCATGGATGTTCATAATCTACCTCT 
TACGGGATACTTTTTATTCTGATGAACAGCCTAATGCCTAAGTGTGCAATCTATACCAAG 

9558 TCCACATATAAGGGATACCATGCTATCTCTGCCTGGATTATTTCAGTTAACATTATGCCC 
TGGAGGTTCATTCATGTTTCTACAAATGACAGGATTTCATTCTTTTTTTTCCAATATATA 
TTTAATGAAATGGATATATATAAACATTGGAAAATGTATATATATATATATATCTCCAGT 
GGAATGCTATTGAGCTATAAAAAAGTTAATATATAATAGAAATAAAGCTTATATATATCT 
AATGGAATGGATATATATATATAATGGAATAGAAATATATATCTATACATATAAACACAC 
[G,A] 

CAATATACATATCCATTTCATTGCATATATATATATATAGAGAGAGAGAGAGAGAGATAT 
TTTCAAATGTGTGTATATATATCCAATGGAATGGACATATATATATGTATATTTTTTCCA 
TATTTTCTTTATGTATTTCTTCATTAATGGATGTTTAGGTTGATTCATCCCTTGGGTATA 
TGAATAATGTTGATGTAAACATAGAAGGACAGATATCTCTATGACTTCTTAGTTTATTTA 
AATATACACCCAGTAATGGAAATGCTGTATAATATGGTAGTTCTATTTTCATTTTTTGAG 

1057 9 CAATGCTTTGATATAACCCATTTATCTACTTTTCCTTTTGTTGGCTGTGCTTCTGAAGTC 
CTATCCAAAAAAATCCTTGCCTAGACCAATGTCACAAATCATTCCTCCTACAGTTTCTTC 
TAGTAGTTGTATAATGTTTGGCCTTATATTTAACTTTGTAATTCATTTTTACTTACTTTG 
TATATGGTGAGGGATAGAGGTCTAGTTTCATTTTCTGCATGTGGATATGCAGTTTTCCTA 
GCACCATTTAGTGAAGAGGTTGCCTTTTTTCTATTATGTGTTCTTGGCACCTTTGTCAAA 
[T,G,A] 

GTCAGTTAGCTGCTATATTCCTCCATTTGTGTTGTTATAGAGGAACACATGAGACTAGCA 
AATTTATATATCAAATAGAATTATTTGAATGATAGTTCTGCATACTGTACAAGAAGCACA 
GCACTGACTTCTGCTTGGCCTCTGGTAAGGTTCTCAAGATGCTTCCACTTGTGGTAGAAG 
GCAAACATGAGCTGGTATATGCAAAGGTCTCATGACAAGAGAGGAAACCATAAAGAGGGG 
ATGTGAGGGAGTGCCAGGTTTTGTAAAACAACTAGCTCTTCTGGGAACTAATAGAGTAAA 

10625 GTGCTTCTGAAGTCCTATCCAAAAAAATCCTTGCCTAGACCAATGTCACAAATCATTCCT 
CCTACAGTTTCTTCTAGTAGTTGTATAATGTTTGGCCTTATATTTAACTTTGTAATTCAT 
TTTTACTTACTTTGTATATGGTGAGGGATAGAGGTCTAGTTTCATTTTCTGCATGTGGAT 
ATGCAGTTTTCCTAGCACCATTTAGTGAAGAGGTTGCCTTTTTTCTATTATGTGTTCTTG 
GCACCTTTGTCAAAAGTCAGTTAGCTGCTATATTCCTCCATTTGTGTTGTTATAGAGGAA 
[C,T] 

ACATGAGACTAGCAAATTTATATATCAAATAGAATTATTTGAATGATAGTTCTGCATACT 
GTACAAGAAGCACAGCACTGACTTCTGCTTGGCCTCTGGTAAGGTTCTCAAGATGCTTCC 
ACTTGTGGTAGAAGGCAAACATGAGCTGGTATATGCAAAGGTCTCATGACAAGAGAGGAA 
ACCATAAAGAGGGGATGTGAGGGAGTGCCAGGTTTTGTAAAACAACTAGCTCTTCTGGGA 
ACTAATAGAGTAAAAATTCGCCTCCCAGGCAGGGGATTAATCTATTCATGAGGGATCTGC 

11147 ACAACTAGCTCTTCTGGGAACTAATAGAGTAAAAATTCGCCTCCCAGGCAGGGGATTAAT 
CTATTCATGAGGGATCTGCTTCCATGACAAAGGCACATTCTGTTAGATTCTACCCCCAAT 
ATTGGGGATCAAATTTTAACATGAAGTGTGGAGGGCTCAAATATCCATACTATGGCAGCA 
GTAAATGCATAAATTTATTTTGTGGATCTCTATTCTATATAGTATTGGTGTATGTATCTG 
TTTTCATGCCACTGCCATACTGTTTTGGTGATGATATCTATGCTATATATGTGTGTGTGT 

[A,G] 

TATATATATTATATATATGTATATATGTGTATATTATATATATGTATATATGTGTATATT 
ATATATATATAATACTTTAAGTTTTATATATATATAAAATACTTTAAGTTCAAGGGTACA 
TGTGCAGGATGTGCAGGTCAGTTACATAGGTATACATGTGCCATTTTGGTTTGCTGCATG 
CATCAACTCATCATTACATTAGGTATTTCTCCTAATGCTATCCCTCCACCAGCCACCCAA 
CCCCCAACAGGCCAGGTGTGTGATGTTCCCCGCCCTGTGTCCATGTGTTCTCATTGTTCA 

15131 CAGGGAAGTTTTCTGTGACTGATGCTATTTTGGTTTGGAGAATGGGGTGATGTGGCACAA 
TGATCTTTCTTCTTTCTGGTCATGGATTTTTTAATTTCCATGAACCCATAAGATTTTTCA 
CTTTTCTTCTGAGCTCTGGTGCTTTCAGAGTGGTATTTTTATATTCGAATAGTTGCTAGT 



FIGURE 3, page 9 of 11 



TGTACTTTTAAAAGCGATTGATGCTGGAGGTCTTCTATTCCACCATCTCGCTGATGTCAG 
TCCTCAAATAATAATTTTATATTTTAGCAAATTATTTTGGTTTTAGGATTTTGTGTCTAC 

[ C T G ] 

TGACACAGACATGAAAAGAGATGTACTCATTACTGAAACTTTTTGCATACTGTTTTGGTT 
GTGCGCCTTTTCTAGTATGAATGATTACATATTTAAGCCACATGTTTTATACATAGACTG 
TCCTTTAAAGAGACTAGATAGTTCTGTGTGTCAGCATATAGGGACAGAATATAACTACAC 
ATTAATAATTTCTCAAGTATTTATTTTAGAAGTGTAAGTAACCTTTATTTTAATTTTTGT 
TATATTATGCCTCTGTAATGCAGATAAATTTTTATCTTCAGGAAATGGAAAATTTTGTCC 

15221 TTAATTTCCATGAACCCATAAGATTTTTCACTTTTCTTCTGAGCTCTGGTGCTTTCAGAG 
TGGTATTTTTATATTCGAATAGTTGCTAGTTGTACTTTTAAAAGCGATTGATGCTGGAGG 
TCTTCTATTCCACCATCTCGCTGATGTCAGTCCTCAAATAATAATTTTATATTTTAGCAA 
ATTATTTTGGTTTTAGGATTTTGTGTCTACGTGACACAGACATGAAAAGAGATGTACTCA 
TTACTGAAACTTTTTGCATACTGTTTTGGTTGTGCGCCTTTTCTAGTATGAATGATTACA 

[A G T ] 

ATTTAAGCCACATGTTTTATACATAGACTGTCCTTTAAAGAGACTAGATAGTTCTGTGTG 
TCAGCATATAGGGACAGAATATAACTACACATTAATAATTTCTCAAGTATTTATTTTAGA 
AGTGTAAGTAACCTTTATTTTAATTTTTGTTATATTATGCCTCTGTAATGCAGATAAATT 
TTTATCTTCAGGAAATGGAAAATTTTGTCCAGAGTTCAGGGGAAGATGGTATTGTGGTGT 
TTTCTCTGGGGTCACTGTTTCAAAATGTTACAGAAGAAAAGGCTAATATCATTGCTTCAG 

15778 GTTTCAAAATGTTACAGAAGAAAAGGCTAATATCATTGCTTCAGCCCTTGCCCAGATCCC 
ACAGAAGGTCAGTAAAACCTCCAATCCTGATAAGCAGCTATTCACATAATGAAACAGTAT 
GGTTTTATTTGGGTCTTGAATCTCATTTTCCACTTAGCATAACAGGTACCAAAATTTGCA 
AAACATTATAGTAGTGTACATGGGCATAACTGATCATTTGCCTACTGAGTCTTGCTGTTA 
CTGGAAACAACTTTCTTGATTGTCATTTGTTTATAATAAAATAGATATAATAAATAAAGC 

[T,C] 

CTACCTTATATTTTAGGATTTGAAATCTAAAAGCGTGTGCCAATGATTCCAAAAAAAAAT 
TCTGACATCTATTATTTCAAAGGACCAGAAAAAGGAAAACTGATATAAAAAAAAAAAGAA 
GAATCAATCTCAAGAATATCTTCTCATATTTGTGTGTATAAA7VACTGTATTCAGGGTAGT 
TTTGCTTAGAAATAAAAGCTCAGATTAATGTAGTCTTTCTAAATAATTAGAAGTTTCAAA 
AGTAAAATGTCAATTACAATTATAGTATAGTAACAATTATTTAAGT/iATGTAATTATTTA 

15895 TATGGTTTTATTTGGGTCTTGAATCTCATTTTCCACTTAGCATAACAGGTACCAAAATTT 
GCAAAACATTATAGTAGTGTACATGGGCATAACTGATCATTTGCCTACTGAGTCTTGCTG 
TTACTGGAAACAACTTTCTTGATTGTCATTTGTTTATAATAAAATAGATATAATAAATAA 
AGCTCTACCTTATATTTTAGGATTTGAAATCTAAAAGCGTGTGCCAATGATTCCAAAAAA 
AAATTCTGACATCTATTATTTCAAAGGACCAGAAAAAGGAAAACTGATATAAT^AAAAAAA 

GAAGAATCAATCTCAAGAATATCTTCTCATATTTGTGTGTATAAAAACTGTATTCAGGGT 
AGTTTTGCTTAGAAATAAAAGCTCAGATTAATGTAGTCTTTCTAAATAATTAGAAGTTTC 
AAAAGTAAAATGTCAATTACAATTATAGTATAGTAACAATTATTTAAGTAATGTAATTAT 
TTATGATACTCCACTAATTTTAACTTTATTATTACTGTAATTCTAGAATTTCACACTTTA 
GATAGTGCTATATATAAACTATCCAAAAGATATTTCATTTTATATTTAGCTAAAATACTT 

19786 GAAACATAATGATGTTTCTTTTTCAATTTAAATAAGCCCTTCTACATAGCCAGCATCAGT 
GATCTCAGAAAATAAATTGCTAATAATGATGACATGGCATTATGCTTAGAAAAGTTTGCT 
GTATTTCCATAGACCTCATCTAGATGTCATGGCCTACATTTCTGCCATCACTCAACCAAT 
ACTTTTTTCTGTTTTCTTGATGATAAAAAGACCTTTCTCATGATTGCCATCAAATAACAA 
AAGAAACTATTTTTTTTCTCACATAGAGAACATGTCAGTAAGATATTCAAGGTGAACAGA 

ATTTTTGGGATTAGTAACTATTTGAAATATGTGGTGATAATTACTGAGTTTATAAAATTT 
ATTTGATAGTACACTTAAAGAAGATTTATATGTTTATTCTTTAAA7\ATGATGAATACTCA 
TAATTCTTATCTCTATAATCAAAAGTATAATTTACTGTAGAAAAATAAAGAGATGCTTGT 
TCTGAAAGTAAGATCAGTGAACTGCTTTTCAGTCTCAATCTTTGAGAATTGTAAATTCAT 
CAAATAATTGCTTACATAGTAAAAATTTAAGGTATTAGAAAACCTGCATAACA^ATAGTA 

20157 ACACTTAAAGAAGATTTATATGTTTATTCTTTAAAAATGATGAATACTCATAATTCTTAT 
CTCTATAATCAAAAGTATAATTTACTGTAGAAAAATAAAGAGATGCTTGTTCTGAAAGTA 
AGATCAGTGAACTGCTTTTCAGTCTCAATCTTTGAGAATTGTAAATTCATCAAATAATTG 
CTTACATAGTAAAAATTTAAGGTATTAGAAAACCTGCATAACAAATAGTATTATATATTA 
AATATTTTGATATGTAAAGCTCTACACAAAGCTAAATATAGTGTAATAATGTTTACACTA 
[G,A] 
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TAAGCAAATATGTTAATCTTCTCATTTTTTTACTGTCATATAATCTTAGTGATATGCCTA 
TTAATAGTTTTAAATAAATAAATTGGCTTATCTGGCTTTTTGAAAATTTTGAAATTCTTA 
CAGATGTTGATTAGGTATATCTACAAATTAATTTCAATTTTAAAATGATGATATAAAAAT 
AAATATAAGTATTTTTCTTGTGTATGTATACAATAAATATAAATAAAATTGTTTACTGTT 
TTGAAAGTTTCTTAAGTTTTTACACTGATATGTTTTTTGACTTTTACAATATTATTATAA 



2024 6 GAAAAATAAAGAGATGCTTGTTCTGAAAGTAAGATCAGTGAACTGCTTTTCAGTCTCAAT 
CTTTGAGAATTGTAAATTCATCAAATAATTGCTTACATAGTAAAAATTTAAGGTATTAGA 
AAACCTGCATAACAAATAGTATTATATATTAAATATTTTGATATGTAAAGCTCTACACAA 
AGCTAAATATAGTGTAATAATGTTTACACTAGTAAGCAAATATGTTAATCTTCTCATTTT 
TTTACTGTCATATAATCTTAGTGATATGCCTATTAATAGTTTTAAATAAATAAATTGGCT 
[T,C] 

ATCTGGCTTTTTGAAAATTTTGAAATTCTTACAGATGTTGATTAGGTATATCTACAAATT 
AATTTCAATTTTAAAATGATGATATAZ\AAATAAATATAAGTATTTTTCTTGTGTATGTAT 
ACAATAAATATAAATAAAATTGTTTACTGTTTTGAAAGTTTCTTAAGTTTTTACACTGAT 
ATGTTTTTTGACTTTTACAATATTATTATAATCTAGGAAAAGCTGATTATATCTGTTTTA 
AGCCTCATCTTTTCTCTGTAATTAAACACAGTAATTTATTAACATGCTGTGACAGGTGGG 

20681 TAAAATTGTTTACTGTTTTGAAAGTTTCTTAAGTTTTTACACTGATATGTTTTTTGACTT 
TTACAATATTATTATAATCTAGGAAAAGCTGATTATATCTGTTTTAAGCCTCATCTTTTC 
TCTGTAATTAAACACAGTAATTTATTAACATGCTGTGACAGGTGGGAAGCCATTTCTGGA 
GTTGAGCCTGCTGACACTCTGGAGCTTTTTAGGTTGGACGTTCATTGTATGTGGGACTCT 
CTGCCTCTCGATAGCTGTTGCTCATAAGACTCTCCTTCATCAATCTGGCATTGAATTTTG 
[C,A] 

GATCAGTTGCAATCAGAATCCAATTGGCCTTGCCGTTTTAGTATGTTCTATCTTAACCAG 
CAATTTCTAACCAGGAGCCTGCCCAGGTTTGTTCTGTCTTCCCTGTAAGAAGCTCCCAGC 
ATAAATATTCTAAATTTTACACTACTAATCTATTAACCAACCTTTGGACCATGTTCACTT 
TAGGTTGAGCATAGTGTGATGAGATGCAAATTAAATTACAATCCTATAGGTGTGTGTTAT 
AAATTTTAAAGTGTATAAATTAAATAACACATTCTAAGTATCCAACAAAGGTCAAAAAAA 

20819 AATTTATTAACATGCTGTGACAGGTGGGAAGCCATTTCTGGAGTTGAGCCTGCTGACACT 
CTGGAGCTTTTTAGGTTGGACGTTCATTGTATGTGGGACTCTCTGCCTCTCGATAGCTGT 
TGCTCATAAGACTCTCCTTCATCAATCTGGCATTGAATTTTGCGATCAGTTGCAATCAGA 
ATCCAATTGGCCTTGCCGTTTTAGTATGTTCTATCTTAACCAGCAATTTCTAACCAGGAG 
CCTGCCCAGGTTTGTTCTGTCTTCCCTGTAAGAAGCTCCCAGCATAAATATTCTAAATTT 
[T,C] 

ACACTACTAATCTATTAACCAACCTTTGGACCATGTTCACTTTAGGTTGAGCATAGTGTG 
ATGAGATGCAAATTAAATTACAATCCTATAGGTGTGTGTTATAAATTTTAAAGTGTATAA 
ATTAAATAACACATTCTAAGTATCCAACAAAGGTCAAAAAAATGATATAAAGTCACCAAA 
C 
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